Codebook for "How Exiles Mobilize Domestic Dissent" Journal of Politics, forthcoming.

By Elizabeth R. Nugent and Alexandra A. Siegel 


** Citation **

Elizabeth R. Nugent and Alexandra A. Siegel. 2024. "How Exiles Mobilize Domestic Dissent.” The Journal of Politics, forthcoming.

@article{NugentSiegel2024,
author = {Nugent, Elizabeth R. and Siegel, Alexandra A.},
journal = {The Journal of Politics},
title = {{How Exiles Mobilize Domestic Dissent}},
volume = {(forthcoming)},
year = {2024}
}


** Youtube Data **

Data collected using the Youtube API and the tuber R package. 

File: "youtube_data.csv" 
 
1. id 
-- youtube video id (from video metadata) 

2. date
-- date video was posted (from video metadata)

3. viewCount
-- total count of video views (from video metadata)

4. likeCount
-- total count of video likes (from video metadata)

5. dislikeCount 
-- total count of video dislikes (from video metadata) 

6. type 
-- Whether the video contains primarily coordination or opposition information. Videos were manually coded by a native Egyptian Arabic speaking RA and codings were validated by authors. Opposition information criticizes Sisi or the Egyptian government; coordination information describes where/when/how to protest. 

** Google Trends Data ***

Data collected using Google Trends API and the gtrends R package. Search term is the Topic "Mohamed Ali (Egyptian Contractor)". Date range is Sept 1 - Nov 1, 2019. This includes all related queries (regardless of language) and captures a wider range of relevant searches than keywords alone. 

Panel Data File: "gtrends_panel.csv" 

1. date
-- date of relative search interest 

2. egypt_search_interest
-- relative search interest in Egypt (0-100) 

3. global_search_interest
-- relative search interest worldwide (0-100) 

Egypt City Data File: "gtrends_egypt_map.csv" 

1. city
-- Egyptian city

2. governorate 
-- Egyptian governorate 

3. relative_search_interest
-- average relative search interest in each governorate (0-100) 


** Facebook Data **

Aggregated data from Facebook posts referencing Mo Ali or his hashtags (September 1 - November 1, 2019). Data collected using the Crowdtangle API.

Facebook Data File: facebook_data.csv

1. date
-- date post created (from Crowdtangle metadata)

2. in_egypt
-- whether the page administrator is located inside Egypt or not (adapted from Crowdtangle metadata) 

3. posts_per_day
-- aggregated daily number of posts containing mobilization-relevant content or referencing Mo Ali 

4. engagement_per_day
-- aggregated daily number of engagements received by posts containing mobilization-relevant content or referencing Mo Ali 

5. views_per_day
-- aggregated daily number of video views on posts containing mobilization-relevant content or referencing Mo Ali 


** Real-Time Twitter Data **

Dataset of all tweets containing Egyptian politics keywords collected in real time using the Streaming API from September 1 - November 1, 2019. 

Data File: realtime_twitter_data.csv

1. id_str
-- Twitter ID (unique id for each tweet, can be used to rehydrate data) 

2. date
-- date tweet was posted

3. mo_ali
-- dummy variable for whether or not tweet was produced by Mo Ali account 

4. mobilization
-- dummy variable for whether or not tweet contains reference to Mo Ali or his hashtags

5. in_egypt
-- variable that takes the value "Inside Egypt" if self-reported user location data references a location within Egypt, "Outside Egypt" if self-reported user location data references a location outside of Egypt, and NA if there is no interpretable self-reported location metadata. 

6. user.followers_count
-- numeric value in tweet metadata indicating user's follower count at the time of tweeting 

7. user.statuses_count
-- numeric value in tweet metadata indicating user's cumulative number of tweets at the time of tweeting 

8. mobilization_local 
-- dummy variable for whether or not tweet contains reference to an Egyptian locality and a reference to Mo Ali or his hashtags 

9. local_mobilization_governorate
-- English name of governorate of location referenced in mobilization_local 

10. num_protests 
-- number of protests in local_mobilization_governorate 



** Twitter Network Data **


Dataset of users producing tweets mentioning Mo Ali's Twitter handle collected using the Academic Twitter API (September 1 - November 1, 2019). 

Data File: "twitter_network_data.csv" 

1. actor.id 
-- user id variable from Twitter metadata

2. actor.followersCount
-- user follower counts from Twitter metadata 

3. k_core 
-- k core of the network that each user is located in (calculated using graphml R package coreness function). 

** Top 100 Mention Network Data for Visualization ** 

Nodes and Edges datasets for top 100 Twitter users most frequently mentioning Mo Ali. Data comes from dataset of all tweets mentioning Mo Ali's Twitter handle collected using the Academic Twitter API (September 1 - November 1, 2019). 

Nodes Data File: "nodes_top100.csv"

1. Id 
-- node id 

2. Label
-- label for plot (only @mohamedsecrets account is labeled to protect user privacy)

3. indegree
-- node in degree in mentions network

4. actor_type 
-- user type (activist, media, entertainment etc) 

5. egyptian 
-- variable taking the value of "Egyptian" if user is Egyptian, and "Not Egyptian" if not 

6. in_egypt
-- variable taking the value of "Egypt" if user self-reports a location in Egypt and "Outside Egypt" if user self-reports a location outside of Egypt 

Edges Data File: "edges_top100.csv" 

1. Source
-- source node id

2. Target
-- target node id


** Protest Data **
Dataset of protest events in Egypt (2010 - 2020) from Egyptian NGO Dafter Ahwal.

Protest Data File: "protest_data.csv"

1. date
-- protest date 

2. governorate
-- Egyptian governorate where protest occurred

** Human Coded Data **

Dataset of 1000 randomly sampled manually coded tweets referencing Mo Ali and/or his hashtags. Data was filtered and sampled from "realtime_twitter_data.csv" 

Human Coded Data File: "human_coded_data.csv" 

1. id_str
-- tweet id 

2. classification 
-- label tweet as "anti regime", "pro regime", "unclear", or "irrelevant"  

